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gDNA TOR HUMAN MFTHYLSNSTSTRABYPROyOLATE R EDUCTASE 

B ACKGROUND OP THE I NVENTION 

(a) field of Invention 

5 The invention relates to a cDNA probe for human 

methylenetetrahydrofolate reductase (MTHFR) , and its 
uses . 

(b) Description of Prior Art 

Folic acid derivatives are coenzymes for 

10 several critical single-carbon transfer reactions, 
including reactions in the biosynthesis of purines, 
thymidylate and methionine. Methylenetetrahydrofolate 
reductase (MTHFR ; EC 1.5.1.20) catalyzes the NADPH- 
linked reduction of 5 ,10-methylenetetrahydrof olate to 

15 5-methyltetrahydrof olate, a co-substrate for methyla- 
tion of homocysteine to methionine. The porcine liver 
enzyme, a f lavoprotein, has been purified to homogene- 
ity? it is a homodimer of 77-kDa subunits . Partial 
proteolysis of the porcine peptide has revealed two 

20 spatially distinct domains: an N-terminal domain of 40 
kDa and a C-terminal domain of 37 kDa. The latter 
domain contains the binding site for the allos-teric 
regulator S-adenosylmethionine. 

Hereditary deficiency of MTHFR, an autosomal 

25 recessive disorder, is the most common inborn error of 
folic acid metabolism. A block in the production of 
methyltetrahydrof olate leads to elevated homocysteine 
with low to normal levels of methionine. patients 
with severe deficiencies of MTHFR (0 -20% activity in 

30 fibroblasts) can have variable phenotypes. Developmen- 
tal delay, mental retardation, motor and gait abnor- 
malities, peripheral neuropathy, seizures and psychi- 
atric disturbances have been reported in this group, 
although at least one patient with severe MTHFR defi- 

35 ciency was asymptomatic- Pathologic changes in the 
severe form include the vascular changes that have 
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been found in other conditions with elevated homocys^ 
teine, as well as reduced * neurotransmitter and 
methionine levels in the CNS. A milder deficiency of 
MTHFR (35-50% activity) has been described in patients 
5 with coronary artery disease (see below).. Genetic 
heterogeneity is likely, considering the diverse 
clinical features, the variable levels of enzyme 
activity, and the differential heat inactivation pro- 
files of the reductase in patients' cells. 

10 Coronary artery disease (CAD) accounts for 25% 

of deaths of Canadians. Cardiovascular risk factors 
(male sex, family history, smoking, hypertension, 
dyslipoproteinemia and diabetes) account for approxi- 
mately 60 to 70% of our ability to discriminate CAD 

15 patients from healthy subjects- Elevated plasma homo- 
cysteine has also been shown to be an independent risk 
factor for cardiovascular disease. 

Homocysteine is a sulf hydryl-containing amino 
acid that is formed by the demethylation of 

20 methionine. It is normally metabolized to cysteine 
( transsulf uration ) or re-methylated to methionine. 
Inborn errors, of metabolism (as in severe MTHFR defi- 
ciency) causing extreme elevations of homocysteine in 
plasma, with homocystinur ia , are associated with pre- 

25 mature vascular disease and widespread arterial and 
venous thrombotic phenomena. Milder elevations of 
plasma homocysteine (as in mild MTHFR deficiency) have 
been * associated with the development of peripheral 
vascular disease, cerebrovascular disease and 

3 0 premature CAD . 

Homocysteine remethylation to methionine 
requires the folic acid intermediate, 5-methyltetrahy- 
drofolate, which is produced from 5, 10-methylenetet- 
rahydrof olate folate through the action of 5,10- 

35 methylenetetrahydrofolate reductase (MTHFR) . Defi- 
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ciency of MTHFR results in an inability to metabolize 
homocysteine to methionine; elevated plasma homocys- 
teine and decreased methionine are the metabolic con- 
sequences of the block. Severe deficiencies of MTHFR 
5 (less than 20% of activity of controls) as described 
above, are associated with early-onset neurologic 
symptoms (mental ret ardation , peripheral neuropathy , 
seizures, etc.) and with a therosclerotic chan ges and 
thromboembolism. Milder deficiencies of MTHFR (35-50% 

10 of activity of controls), with a thermolabile form of 
the enzyme, are seen in patients with c ardiovascular 
disease without obvious n eurologic abnormalitie s. 

In a survey of 212 patients with proven coro- 
nary artery disease, the thermolabile form of MTHFR 

15 was found in 17% of the CAD group and 5% of controls. 
In a subsequent report on 339 subjects who underwent 
coronary angiography, a correlation was found between 
thermolabile MTHFR and the degree of coronary artery 
stenosis- Again, traditional risk factors (age, sex, 

20 smoking, hypertension, etc.) were not significantly 
associated with thermolabile MTHFR. All the studies on 
MTHFR were performed by enzymatic assays of MTHFR in 
lymphocytes, with measurements of activity before and 
after heat treatment to determine thermo lability of 

2 5 the enzyme. 

Since 5-methyltetrahydrof olate, the product 
of the MTHFR reaction, is the primary form of circula- 
tory -folate, a deficiency in MTHFR might lead to other 
types of disorders. For example, periconceptual folate 

30 administration to women reduces the occurrence and 
recurrence of neural tube defects in their offspring. 
Neural tube defects are a group of developmental mal- 
formations ( meningomyelocele , anencephaly , encephalo- 
cele) that arise due to failure of closure of the neu- 

35 ral tube. Elevated levels of plasma homocysteine have 
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been reported in mothers of children with neural tube 
defects- The elevated plasma homocysteine could be due 
to a deficiency of MTHFR, as described above for car- 
diovascular disease. 
5 Neuroblastomas are tumors derived from neural 

crest cells. Many of these tumors have been reported 
to have deletions of human chromosome region lp3 6, the 
region of the genome to which MTHFR has been mapped. 
It is possible that MTHFR deletions/mutations are 

10 responsible for or contribute to the formation of this 
type of tumor- MTHFR abnormalities may also contribu- 
tion to the formation of other types of tumors, such 
as color ectal tumo rs, since high dietary folate has 
been shown to be inversely associated with risk of 

15 colorectal adenomas, 

MTHFR activity is required for homocysteine 
methylation to methionine. Methionine is necessary for 
the formation of S-adenosylmethionine, the primary 
methyl donor for methylation of DNA, proteins, lipids, 

20 neurotransmitters, etc. Abnormalities in MTHFR might 
lead to lower levels of methionine and S-adenosylme- 
thionine, as well as to elevated homocysteine * Disrup- 
tion of methylation processes could result in a wide 
variety of conditions, such as neoplasias, developmen- 

25 tal anomalies, neurologic disorders, etc. 

Although the MTHFR gene in Escherichia coli 
imetF) has been isolated and sequenced, molecular 
studies of the enzyme in higher organisms have been 
limited without the availability of a eukaryotic cDNA ♦ 

30 In this communication, we report the isolation of a 
human cDNA for MTHFR, its chromosomal assignment, and 
the identification of mutations in MTHFR-def icient 
patients. This report represents the first molecular 
description of mutations in MTHFR deficiency. 
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It would be highly desirable to be provided 
with a cDNA probe for human methylenetetrahydrof olate 
reductase (MTHFR). This probe would be used for iden- 
tification of sequence abnormalities in individuals 
5 with severe or mild MTHFR deficiency, including car- 
diovascular patients and patients with neurologic 
symptoms or tumors. The probe would also be used in 
gene therapy, isolation of the gene, and expression 
studies to produce the MTHFR protein. The probe would 
10 also provide the amino acid sequence of the human 
MTHFR protein, which would be useful for therapy of 
MTHFR deficiency by biochemical or pharmacological 
approaches. 

It would be highly desirable to be provided 
15 with a molecular description of mutations in methyl- 
enetetrahydrof olate reductase deficiency. 

SUMMARY OF THE INVENTION 

One aim of the present invention is to provide 
20 a cDNA probe for human methylenetetrahydrof olate 
reductase (MTHFR) , 

Another aim of the present invention is to pro- 
vide a molecular description of mutations in methyl- 
enetetrahydrof olate reductase deficiency. 
25 Another aim of the present invention is to pro- 

vide a nucleic acid and amino acid sequence for human 
methylenetetrahydrof olate reductase . 

Another aim of the present invention is to pro- 
vide potential therapy for individuals with methyl- 
3 0 enetetrahydrof olate reductase deficiency. 

Another aim of the present invention is to pro- 
vide a system for synthesis of MTHFR protein in vitro. 

A further aim of the present invention is to 
provide for a technology/protocol for identification 
35 of sequence changes in the MTHFR gene. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A to IE illustrate the first cDNA coding 
sequence for methylenetetrahydrof olate reductase 
5 { MTHFR ) ; 

Fig. 2 is the alignment of amino acids for 
human methylenetetrahydrof olate reductase (MTHFR), the 
metF genes from E. coli (ECOMETF), and S. typhlmvrium 
( STYMETF ) , and an unidentified open reading frame in 
10 Saccharomyces cerevisiae that is divergently tran- 
scribed from an excision repair gene (ysRADl ) ; 

Figs. 3A and 3B illustrate the sequencing and 
restriction enzyme analysis for the Arg to Ter substi- 
tution; 

15 Figs- 4A and 4B illustrate the sequencing and 

restriction enzyme analysis for the Arg to Gin substi- 
tution; 

Figs. 5A and SB illustrates the sequence change 
and restriction enzyme analysis for the alanine to 
20 ' valine substitution; 

Fig* 6 illustrates the total available sequence 
of human MTHFR cDNA; 

Figs. 7A and 7B illustrate the expression 
analysis of MTHFR cDNA in E. coli, respectively (7A) 
25 the Western blot of bacterial extracts and tissues, 
and (7B) the thermolability assay of bacterial 
extracts; 

Figs. 8A to 8D illustrate the identification of 
a 5* splice site mutation leading to a 57 bp in-frame 
30 deletion of the cDNA; 

Figs. 9A to 9D illustrate the diagnostic 
restriction endonuclease analysis of 4 mutations; 

Figs. 10A to 10D illustrate the ASO hybridiza- 
tion analysis of 2 mutations; and 
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Fig. 11 illustrates the region of homology 
between human methylenetetrahydrof olate reductase 
(MTHFR) and human dihydrof olate reductase (DHFR) . 

5 DETAILED D ESCRIPTION OF THE I NVENTION 

Sequencing of peptides from porcine MTHFR 

Homogeneous native porcine MTHFR was digested 
with trypsin to generate a 40 kDa N-terminal fragment 

10 and a 31 kDa C-terminal fragment; the 31 kDa fragment 
is a proteolytic product of the 37 kDa fragment. The 
fragments were separated by SDS-PAGE, electroeluted, 
and the denatured fragments were digested with lysyl 
endopeptidase (LysC). The resulting peptides were 

15 separated by reversed-phase HPLC and subjected to 
sequence analysis by Edman degradation (details con- 
tained in Goyette P et al . , Nature Genetics, 1994, 
2:195-200) . 

20 Isolation and sequencing of cDNAs 

Two degenerate oligonucleotides were synthe- 
sized based on the sequence of a 30 amino acid porcine 
MTHFR peptide (first underlined peptide in Fig. 2), 
These were used to generate a 90 bp PCR product, 

25 encoding the predicted peptide, from reverse tran- 
scription-PCR reactions of 500 ng pig liver polyA+ 
RNA. A pig-specific (non-degenerate, antisense) PCR 
primer was then synthesized from this short cDKA 
sequence- Using this primer and a primer for phage 

3 0 arms, a human liver XgtlQ cDNA library (Clontech) was 
screened by PCR? this technique involved the genera- 
tion of phage lysate stocks (50,000 pfu) which were 
boiled for 5 mins and then used directly in PCR reac- 
tions with these two primers. PCR fragments were 

35 then sequenced directly (Cycle Sequencing™ kit, 
GIBCO), and a positive clone was identified by com- 
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parison of the deduced amino acid sequence to the 
sequence of the pig peptide (allowing for inter-spe- 
cies variations). The positive stock was then 
replated at lower density and screened with the radio- 
5 labelled positive PCR product by plague hybridization 
until a well-isolated plaque was identified. Phage 
DNA was purified and the insert was then subcloned 
into pBS+ (Bluescript) and sequenced on both strands 
(Cycle Sequencing™ kit, GIBCO and Sequenase™, Pharma- 

10 cia). The deduced amino acid sequence of the human 
cDNA was aligned to the porcine peptide sequences, the 
metF genes from E.coli ( ecometf , accession number 
VO1502) and S. typhimurium {stymetf , accession number 
X07689) and with a previously unidentified open read- 

15 ing frame in Saccharomyces cerevisiae that is diver- 
gently transcribed with respect to the excision repair 
gene, ysRADl (accession number KO2070K The initial 
alignments were performed using BestFit™ in the GCG 
computer package, and these alignments were adjusted 

20 manually to maximize homologies. 

In summary, degenerate oligonucleotide primers 
were designed to amplify a sequence corresponding to a 
30-amino acid segment of a porcine peptide from the N- 
terminal region of the enzyme (first porcine peptide 

25 in Fig. 2). A 90-bp porcine cDNA fragment was 
obtained from reverse transcr iption/PCR of pig liver 
RNA. Sequencing of the PCR fragment confirmed its 
identity by comparison of the deduced amino acid 
sequence to the porcine peptide sequence. A nondegen- 

30- erate oligonucleotide primer, based on the internal 
sequence of the porcine cDNA, was used in conjunction 
with . primers for the phage arms to screen a human 
liver kgtlO cDNA library by PCR * The insert of the 
positive clone was isolated and sequenced. The 
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sequence consisted of 1266 bp with one continuous open 
reading frame. 

Homology with MTHFR in other species 

5 The deduced amino acid sequence of the human 

cDNA was aligned with the metF genes from E.coli and 
S.typhimurium, as well as with a previously unidenti- 
fied ORF in Saccharomyces cerevisxae that is diver- 
gently transcribed with respect to the excision repair 

10 gene, ysRADl (Fig. 2). The sequences homologous to 5 
porcine peptides are underlined in Fig. 2, Three seg- 
ments (residues 61-94, 219-240, and 337-351) corre- 
spond to internal peptide sequence from the N-terrainal 
40 kDa domain of the porcine liver enzyme. Residues 

15 374-393 correspond to the upstream portion of the LysC 
peptide from the C-terminal domain of the porcine 
liver enzyme that is labeled when the enzyme is irra- 
diated with UV light in the presence of ( 3 H- 
raethyl ) AdoMet ; as predicted from the AdoMet labeling 

20 studies, this peptide lies at one end ( N-terminal ) of 
the 3 7 kDa domain. A fifth region of homology 
(residues 359-372) was also identified, but the local- 
ization of the porcine peptide within the native pro- 
tein had not been previously determined. 

2 5 Methylenetetrahydrofolate reductase (MTHFR) is 

an enzyme involved in amino acid metabolism, that is 
critical for maintaining an adequate methionine pool, 
as* well as for ensuring that the homocysteine concen- 
tration does not reach toxic levels. The high degree 
30 of sequence conservation, from E.coli to Homo sapiens, 
attests to the significance of MTHFR in these species. 
The enzyme in E.coli (encoded by the metF locus) is a 
33 kDa peptide that binds reduced FAD and catalyzes 
the reduction of methylenetetrahydrofolate to 

3 5 methyltetrahydrofolate. The metF enzyme differs from 
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the mammalian enzyme in that it cannot be reduced by 
NADPH or NADH , and its activity is not allosterically 
regulated by S-adenosylmethionine . The native porcine 
enzyme is susceptible to tryptic cleavage between the 
5 N-terminal 40 kDa domain and the C-terminal 37 kDa 
domain, and this cleavage results in the loss of 
allosteric regulation by adenosylmethionine, but does 
not result in loss of catalytic activity. Since the 
homology between the bacterial and mammalian enzymes 

10 is within the N-terminal domain, this region must con- 
tain the flavin binding site and residues necessary to 
bind the folate substrate and catalyze its reduction. 
The domain structure of the human enzyme has not been 
elucidated, although the human enzyme has been 

15 reported to have a molecular mass of 150 kDa and is 
likely to be a homodimer of 77 kDa. 

We predict that the point of cleavage between 
the two domains lies between residues 3 51 and 37 4 of 
the human sequence, based on the localization of pep- 

20 tides obtained from the isolated domains of the por- 
cine enzyme. This region, containing the highly- 
charged sequence KRREED, is predicted to have the 
highest hydrophilicity and surface probability of any 
region in the deduced human sequence. 

25 The N-terminus of the porcine protein has been 

sequenced, and the region encoding this part of the 
protein is missing from the human cDNA . We estimate 
that this cDNA is missing only a few residues at the 
N-terminus, since the predicted molecular mass of the 

30 deduced sequence upstream of the putative cleavage 
site (KRREED) is 40 kDa, and the measured molecular 
mass of the porcine N-terminal domain is also 40 kDa. 
When the bacterial, yeast and human sequences are 
aligned, the deduced human sequence contains an N-ter- 

35 minal extension of 40 amino acids; we suspect that 
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this extension contains determinants for NADPH bind- 
ing. Many pyridine nucleotide-dependent oxidoreduc- 
tases contain such determinants at the N-terminus of 
the protein . 

5 The C-terminus of the human sequence contains a 

peptide that is labeled when the protein is irradiated 
with ultraviolet light in the presence of tritiated 
AdoMet . The cDNA sequence we report here contains 
only about 7 kDa of the predicted 37 kDa mass of this 

10 domain , indicating that this cDNA is truncated at the 
3* terminus as well. A number of peptides from the C- 
terminal porcine domain have also not been detected. 
As might be expected, given that the prokaryotic 
enzymes do not appear to be allosterically regulated 

15 by AdoMet, there are no significant homologies between 
the Oterminal region in this cDNA and the prokaryotic 
metF sequences. The alignment shown in Fig. 2 shows 
that the homologous sequences terminate just prior to 
the putative cleavage site of the human enzyme. 

20 ... 

Chromosomal assignment 

In situ hybridization to metaphase human chro- 
mosomes was used for localization of the human gene. 
The analysis of the distribution of 200 silver grains 
25 revealed a significant clustering of grains 40 grains, 
in the p36.3-36. 2 region of chromosome 1 (p<0.0001), 
with the majority of grains, 25 grains, observed over 
lp36,3. 

The isolation of the human cDNA has allowed us 
30- to localize the gene to chromosome lp36.3. The obser- 
vation of one strong signal on that chromosome with 
little background is highly suggestive of a single 
locus with no pseudogenes . Southern blotting of human 
DNA revealed fragments of approximately 10 kb, pre- 
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dieting a gene of average size, since this cDNA 
encodes approximately half of the coding sequence. 

Additional cDNA sequences and constructs for expres- 
5 sion analysis 

A human colon carcinoma cDNA library (gift of 

Dr. Nicole Beauchemin, McGill University) was screened 

by plaque hybridization with the original 1.3 kb cDNA 

to obtain additional coding sequences. A cDNA of 2.2 

10 kb was isolated, which contained 1.3 kb of overlapping 
sequence to the original cDNA and 900 additional bp at 
the 3' end (Fig. 6). The amino acid sequence is iden- 
tical to that of the original cDNA for the overlapping 
segment (codons 1-415) except for codon 177 (ASP) 

15 which was a GLY codon in the original cDNA. Analysis 
of 50 control chromosomes revealed an ASP codon at 
this position. The cDNA has an open reading frame of 
1980 bp, 100 bp of 3 1 UTR and a poly A tail. 

Sequencing was performed on both strands for 

20 the entire cDNA. Additional 5* sequences (800 bp) 
were obtained from a human kidney cDNA library 
(Clontech) but these sequences did not contain addi- 
tional coding sequences and were therefore used for 
the PCR-based mutagenesis only (as described below) 

25 and not for the expression analysis. The two cDNAs 
(2.2 kb and 800 bp) were ligated using the EcoRl site 
at bp 199 and inserted into the Bluescript™ vector 
(Stratagene) . The 2.2 kb cDNA was subcloned into the 
expression vector pTrc99A (Pharmacia) using the Ncol 

30 site at bp 11 and the Xbal site in the polylinker 
region of both the Bluescript™ and the pTrc99A vec- 
tors. Sequencing was performed across the cloning 
sites to verify the wild-type construct. 
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UTILITY OF INVENTION IN IDENTIFICATION OF MUTATIONS 

I. Identification of first two mutations in severe 
MTHFR deficiency 

Total RNA of skin fibroblasts from MTHFR-defi- 
5 cient patients was reverse-transcribed and amplified 
by PCR for analysis by the single strand conformation 
polymorphism (SSCP) method (Orita, M. et al . , Genom- 
ics, 1989, 5:8874-8879)- Primers were designed to 
generate fragments of 250-300 bp and to cover the 

10 available cDNA sequences with small regions of overlap 
for each fragment at both ends. The first mutation 
identified by SSCP was a C to T substitution at bp 559 
in patient 1554; this substitution converted an 
arginine codon to a termination codon (Fig. 3A) . 

15 Since the mutation abolished a Fokl site, restriction 
digestion was used for confirmation of the change and 
for screening additional patients for this mutation; a 
second patient (1627) was identified in this manner 
(Fig. 3B). The SSCP pattern for patient 1554 and the 

20 . restriction digestion pattern for both patients was 
consistent with a homozygous mutant state or with a 
genetic compound consisting of the nonsense mutation 
with a second mutation that did not produce any 
detectable RNA (null allele)* Studies in the parents 

25 are required for confirmation. 

The second substitution (Fig. Ah) was a G to A 
transition at bp 482 in patient 1834 that converted an 
arginine into a glutamine residue. The . substitution 
created a Pstl site which was used to verify the sub- 

30 stitution and to identify a second patient (1863) with 
this change (Fig. 4B). The SSCP analysis and the 
restriction digestion pattern were consistent with a 
heterozygous state for both patients. The arginine 
codon affected by this change is an evolutionarily- 

35 conserved residue, as shown in Fig* 2. This observa- 
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tion, in conjunction with the fact that the codon 
change is not conservative, makes a strong argument 
that the substitution is a pathologic change rather 
than a benign polymorphism* Furthermore, 3 5 controls 
5 (of similar ethnic background to that of the probands) 
were tested for this substitution by Southern blotting 
of Pstl-digested DNA; all were negative. 

The family of patient 1834 was studied. The 
symptomatic brother and the mother of the proband were 
10 all shown to carry this substitution, whereas the 
father was negative for the change (Fig. 4B). In the 
family of 1863 , the mother of the proband was shown to 
be a carrier, while the father and an unaffected 
brother were negative. 

15 

Cell lines 

Cell line 1554 is from a Hopi male who was 
admitted at age three months with homocystinuria, sei- 
zures/ dehydration/ corneal clouding, hypotonia and 

20 .Candida sepsis. Folate distribution in cultured 
fibroblasts showed a Pediococcus 

cerivisiae/Lactobacillus cased (PC/LC) ratio of 0.52 
(Control 0.14). There was no measurable methyl- 
enetetrahydrof olate reductase (MTHFR) activity 

25 (Control values = 9,7 and 15.1 nmoles/h/mg protein ; 
residual activity after treatment of control extracts 
at 55°C for 20 min. = 28% and 31%). 

Cell line 1627 is from a Choctaw male who pre- 
sented with poor feeding, apnea, failure to thrive, 

30 dehydration and homocystinuria at five weeks of age. 
He was subsequently found to have superior sagittal 
sinus thrombosis and hydrocephalus. The PC/LC ratio 
was 0.61 and the specific activity of MTHFR was 0-1 
nmoles/h/mg protein. There is consanguinity in that 
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the maternal and paternal grandmothers are thought to 
be "distantly related". 

Cell line 1779 is from a French Canadian male 
with homocystinuria who first had limb weakness, inco- 
5 ordination , paresthesiae , and memory lapses at age 15 
years, and was wheelchair-bound in his early twenties. 
His brother (cell line 1834) also has homocystinuria/ 
but is 37 years old and asymptomatic. Specific activ- 
ity of MTHFR was 0.7 and 0.9 nmole/h/mg protein for 

10 1779 and 1834, respectively; the residual activity 
after heat treatment at 55°C was 0.9% and 0% for 1779 
and 1834, respectively. 

Cell line 1863 is from a white male who was 
diagnosed at age 21 years because of a progressive 

15 gait disturbance, spasticity, cerebral white matter 
degeneration, and homocystinuria. He had a brother 
who died at age 21 years of neurodegenerative disease. 
Specific activity of MTHFR in fibroblast extracts was 
1.76 nmoles/h/mg protein and the residual enzyme ac- 

20 - tivity after treatment at 55°C was 3.6%. 

Mutation analysis 

Primers were designed from the cDNA sequence to 
generate 250-300 bp fragments which overlapped 50- 

25 7 5 bp at each end. The primer pairs were used in 
reverse transcription-PCR of 5ug patient total fibro- 
blast RKA. The PCR products were analyzed by a non- 
isotopic rapid SSCP protocol ( PhastSystem™, Pharma- 
cia), which uses direct silver staining for detection 

30 of single strands. Any PCR products from patients 
showing a shift on SSCP gels were purified by NuSieve 
(FMC Bioproducts) and sequenced directly (Cycle 
Sequencing™ kit, GIBCO) to identify the change. If 
the change affected a restriction site, then a PCR 

35 product was digested with the appropriate restriction 
endonuclease and analyzed on polyacrylamide gels. To 
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screen for the Arg to Gin mutation in controls, 5 
of Pstl-digested DNA was run on 0.8% agarose ge-.s and 
analyzed by Southern blotting using the radiolabeled 
cDNA by standard techniques* 

5 

II. Seven additional mutations at the methylene- 
tetrahydrofolate reductase < MTHFR) locus with 
genotype: phenotype correlations in severe MTHFR 
deficiency 

10 

It is reported hereinbelow the characterization 
of 7 additional mutations at this locus: 6 missense 
mutations and a 5* splice site defect which activates 
a cryptic splice site in the coding sequence. We also 

15 present a preliminary analysis of the relationship 
between genotype and phenotype for all 9 mutations 
identified thus far at this locus. A nonsense muta- 
tion and 2 missense mutations (proline to leucine and 
threonine to methionine) in the homozygous state are 

20 associated with extremely low activity (0-3%) and 
onset of symptoms within the first year. Other mis- 
sense mutations (arginine to cysteine and arginine to 
glutamine) are associated with higher enzyme activity 
and later onset of symptoms. 

25 7 additional mutations at the MTHFR locus are 

described and the association between genotype, enzyme 
activity, and clinical phenotype in severe MTHFR defi- 
ciency is examined- 

30 Patient description 

The clinical and laboratory findings of the 
patients have been reported in the published litera- 
ture, Residual MTHFR activity was previously measured 
in cultured fibroblasts at confluence. 
35 Patient 354, an African-American girl, was 

diagnosed at age 13 years with mild mental retarda- 
tion. Her sister, patient 355 was diagnosed at age 15 
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years with anorexia, tremor, hallucinations and pro- 
gressive withdrawal* In patient 354, residual MTHFR 
activity was 19% and in her sister, 355, it was 14% of 
control values. The residual activity after heating 
5 had equivalent thermal stability to control enzyme. 

Patient 1807, a Japanese girl whose parents are 
first cousins, had delayed walking and speech until 
age 2 years, seizures at age 6 years and a gait dis- 
turbance with peripheral neuropathy at age 16 years, 

10 Residual activity of MTHFR was 3% and the enzyme was 
thermo labile. 

Patient 735, an African-Indian girl, was diag- 
nosed at . age 7 months with microcephaly, progressive 
deterioration of mental development, apnea and coma. 

15 Residual activity of MTHFR was 2% of control levels. 
Thermal properties were not determined. 

Patient 1084, a Caucasian male, was diagnosed 
at age 3 months with an infantile fibrosarcoma- He 
was found to be hypotonic and became apneic . He died 

20 at the age of 4 months. Residual activity of MTHFR 
was not detectable. Thermal properties were not 
determined. 

Patient 356, the first patient reported with 
MTHFR deficiency, is an Italian-American male who pre- 
25 sented at age 16 years with muscle weakness, abnormal 
gait and flinging movements of the upper extremities. 
MTHFR residual activity was 20% of control values; 
activity was rapidly and exponentially inactivated at 
55°. 

30 Patient 458, a Caucasian male, was diagnosed at 

age 12 years with ataxia and marginal school perform- 
ance. Residual MTHFR activity was approximately 10%, 
and the activity was thermolabile . 

Patient 1396, a Caucasian female, was described 

35 as clumsy and as having a global learning disorder in 
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childhood. At age 14 years, she developed ataxia, 
foot drop, and inability to walk. She developed deep 
vein thrombosis and bilateral pulmonary emboli- Resid- 
ual activity of MTHFR was 14% and the enzyme was ther- 
5 molabile. 

Genomic structure and intronic primers 

Exon nomenclature is based on available cDNA 
sequence in Goyette et al. (Nature Genetics, 1994, 

10 1:195-200). Exon 1 has been arbitrarily designated as 
the region of cDNA from bp 1 to the first intron. 
Identification of introns was performed by amplifica- 
tion of genomic DNA using cDNA primer sequences. PCR 
products that were greater in size than expected cDNA 

15 sizes were sequenced directly. 

Mutation detection 

Specific exons (see Table 1 for primer 
sequences) were amplified by PCR from genomic DNA and 

20 . analyzed by the SSCP protocol. SSCP was performed 
with the Phastgel™ system (Pharmacia), a non-isotopic 
rapid SSCP protocol, as previously described (Goyette 
P et al., Nature Genetics, 1994 , j: 195-20,0 ) , or with 
35 S~labeled PCR products run on 6% acrylamide: 10% 

25 glycerol gels at room temperature (6 watts, over- 
night). In some cases, the use of restriction endonu- 
cleases, to cleave the PCR product before SSCP analy- 
sis, enhanced the detection of band shifts. PCR frag- 
ments with altered mobility were sequenced directly 

30 (GIBCO, Cycle Sequencing™ kit). If the sequence 
change affected a restriction endonuclease site, then 
the PCR product was digested with the appropriate 
enzyme and analyzed by PAGE. Otherwise, allele-spe- 
cific oligonucleotide (ASO) hybridization was per- 

35 formed on a dot blot of the PCR-amplif ied exon. 
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Table 1 

PCR Primers for DNA amplification and mutation 
analysis of MTHFR 



Exon 


Primer Type 


Primer Sequence <6'-*3') 


Location 


Fragment 
Size (bp) 


1 


Sense 

Antisense 


AGCCTCAACCCCTGCTTGGAGG 
TGACAGTTTGCTCCCCAGGCAC 


C 
I 


271 


4 


Sense 
Antisense 


TGAAGGAGAAGGTGTCTGCGGGA 
AGGACGGTGCGGTGAGAGTGG 


c 
I 


198 


5 


Sense 
Antisense 


CACTGTGGTTGGCATGGATGATG 
GGCTGCTCTTGGACCCTCCTC 


I 
I 


392 


6 


Sense 
Antisense 


TGCTTCCGGCTCCCTCTAGCC 
CCTCCCGCTCCCAAGAACAAAG 


I 


251 



Table 2 

Summary of genotypes, enzyme activity, age at onset, 
and background of patients with MTHFR deficiency 



Patient a BPChange* D Amino acid % Age at Onset Background 
changes Activity — 



1607 


C764T/C764T 


Pro-> Leu/Pro-^ Leu 


3 


within 1st year 


Japanese 


735 


C692T/C692T 


Thr-»Met/Thr-*Met 


2 


7 months 


African Indian 


1084 


C692T/C692T 


Thr->Met/Thr-»Met 


0 


3 months 


Caucasian 


1654 


C559T7C559T 


Arg-*Ter/Arg-*Ter 


0 


1 month 


Native American 
(Hopi) 


1627 


C559T/C559T 


Arg-*Ter/Arg->Ter 


1 


1 month 


Native American 
(Choctaw) 


356 


C985T/C985T 


Arg->Cys/Arg-*Cys 


20 


16 yrs 


Italian American 


458 


C1015T/G167A 


Arg->Cys/Arg-»Gln 


10 


11 yrs 


Caucasian 


1396 


C1081T/G167A 


Arg->Cys/Aro->Gln 


14 


14 yrs 


Caucasian 


1779 C 


G482A/7 


Arg->G(n/7 


6 


15 yrs 


French Canadian 


1B34 C 


G482A/7 


Arg-»GlrV? 


7 


Asymptomatic 
at 37 yrs 


French Canadian 


1863 


G482A/7 


Arg->Gln/? 


14 


21 yrs 


Caucasian 


354 d 


792 + 1G-»A/7 


5' splice site/? 


19 


13 yrs 


African American 


355 d 


792 + 1 G-»A/? 


5' splice site/7 


14 


1 1 yrs 


African American 



10 a Patients 1554, 1627, 1779, 1834 and 1863 were previously reported by Goyette et al. (1994). 
b ? = unidentified mutation. 
c Patients 1779 and 1 834 are sibs. 
d Patients 354 and 355 are sibs. 
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( 1 ) 5' splice site mutation 

Amplification of cDNA, bp 653-93 9, from 
reverse-transcribed total fibroblast RNA revealed 2 
5 bands in sisters 354 and 355: a smaller PCR fragment 
(230 bp) in addition to the normal 287 bp allele 
(Fig. 8A) - Fig. 8A is the PAGE analysis of amplifica- 
tion products of cDNA bp 653-939, from reverse tran- 
scribed RNA. Controls have the expected 287 bp frag- 

10 ment while patients 354 and 355 have an additional 230 
bp fragment. Sequencing of the smaller fragment iden- 
tified a 57 bp in-frame deletion which would remove 19 
amino acids {Fig* 8BK Fig, 8B is the direct sequenc- 
ing of the PCR products from patient 354. The 57 bp 

15 deletion spans bp 736-792 of the cDNA. An almost per- 
fect 5* splice site (boxed) is seen at the 5' deletion 
breakpoint. Analysis of the sequence at the 5' dele- 
tion breakpoint in the undeleted fragment revealed an 
almost perfect 5 1 splice site consensus sequence 

20 • (AG/gcatgc). This observation suggested the presence 
of a splicing mutation in the natural 5' splice site 
that might activate this cryptic site, to generate the 
deleted allele. The sequence following the deletion 
breakpoint, in the mutant allele, corresponded exactly 

25 to the sequence of the next exon. Amplification of 
genomic DNA, using the same amplification primers as 
those used for reverse-transcribed RNA, generated a 
1-2 kb PCR product indicating the presence of an 
intron. Direct sequencing of this PCR fragment in 

30 patient 354 identified a heterozygous G~-»A substitution 
in the conserved GT dinucleotide of the intron at the 
5' splice site (Fig. 8C). Fig. 8C is the sequencing 
of the 5' splice site in control and patient 354. The 
patient carries a heterozygous G— >h substitution in the 

35 5' splice site (boxed). Intronic sequences are in 
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lower case. This substitution abolished a HphI 
restriction endonuclease site which was used to con- 
firm the mutation in the 2 sisters (Fig, 8D). Fig. 8D 
is the HphI restriction endonuclease analysis on PCR 
5 products of DNA for exon 4 of patients 354 and 355, 
and of 3 controls <C). The 198 bp PCR product has 2 
HphI sites. The products of digestion for the control 
allele are 151, 24 and 23 bp. The products of diges- 
tion for the mutant allele are 175 and 23 bp due to 
10 the loss of a HphI site. The fragments of 24 and 23 bp 
have been run off the gel. 

(2) Patients with homozygous coding substitutions 

SSCP analysis of exon 4 for patient 1807 

15 revealed an abnormally-migrating fragment, which was 
directly sequenced to reveal a homozygous C-^T substi- 
tution <bp 764) converting a proline to a leucine 
residue v This change creates a Mnll restriction endo- 
nuclease site, which was used to confirm the homozy- 

20 • gous state of the mutation (Fig. 9A). Fig. 9A is the 
Mnll restriction analysis of exon 4 PCR products for 
patient 1807 and 3 controls (C). Expected fragments: 
control allele, 90, 46, 44, 18 bp; mutant allele, 73, 
46, 44, 18, 17 bp. An additional band at the bottom 

25 of the gel is the primer. Fifty independent control 
Caucasian chromosomes and 12 control Japanese chromo- 
somes were tested by restriction analysis; all were 
negative for this mutation. Homozygosity in this pa- 
tient is probably due to the consanguinity of the 

3 0 parents. 

Patients 735 and 1084 had the same mutation in 
exon 4, in a homozygous state: a C-»T substitution (bp 
692) which converted an evolutionarily-conserved 
threonine residue to a methionine residue, and abol- 
35 ished a Attain restriction endonuclease site. Allele- 



WO 95/33054 



PCT/CA95/00314 



specific oligonucleotide hybridization to amplified 
exon 4 (Figs. 10A and 10B) was used to confirm the 
mutation in these 2 patients and to screen 60 inde- 
pendent chromosomes, all of which turned out to be 
5 negative. Fig. 10A is the hybridization of mutant 
oligonucleotide (692T) to exon 4 PGR products from 
patients 735, 1084 and 30 controls- Only DNA from 
patients 735 and 1084 hybridized to this probe. 
Fig. 10B is the hybridization of normal oligonucleo- 

10 tide (692C) to stripped dot blot from A. All control 
DNAs hybridized to this probe. 

Patient 356 showed a shift on SSCP analysis of 
exon 5- Direct sequencing revealed a homozygous C-*T 
substitution (bp 985) which converted an evolutionary 

15 ily-conserved arginine residue to cysteine; the sub- 
stitution abolished an Acil restriction endonuclease 
site. This was used to confirm the homozygous state 
of the mutation in patient 356 (Fig. 9B) and its pres- 
ence in the heterozygous state in both parents. Fifty 

20 * independent control chromosomes, tested in the same 
manner, were negative for this mutation- Fig. 9B is 
the Acil restriction analysis of exon 5 PCR products 
for patient 356, his father (F), his mother (M), and 3 
controls (C). Expected fragments: control allele, 

25 129, 105, 90, 68 bp; mutant allele, 195, 129, 68 bp. 

( 3.) Patients who are genetic compounds 

Patient 458 is a compound heterozygote of a 
mutation in exon 5 and a mutation in exon 1. The exon 

30 5 substitution <C— »T at bp 1015) resulted in the sub- 
stitution of a cysteine residue for an arginine resi- 
due; this abolished a Hhal restriction endonuclease 
site, which was used to confirm the mutation in 
patient 458 (Fig, 9C) and to show that 50 control 

35 chromosomes were negative. Fig. 9C is the Hhal 
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restriction analysis of exon 5 PCR products for 
patient 458 and 4 controls (C). Expected fragments: 
control allele, 317 and 75 bp; mutant allele 392 bp. 
The 7 5 bp fragment is not shown in Fig. 9C. The 
5 second mutation was a heterozygous G-»A substitution 
(bp 167) converting an arginine to a glutamine resi- 
due, Allele-specif ic oligonucleotide hybridization to 
amplified exon 1 confirmed the heterozygous state of 
this mutation in patient 458 and identified a second 

10 patient (1396) carrying this mutation also in the 
heterozygous state (Figs- 10C and 10D). Fig, 10C is 
the hybridization of mutant oligonucleotide (167A) to 
exon 1 PCR products from patients 458, 1396 and 31 
controls. Fig- 10D is the hybridization of normal 

15 oligonucleotide (167G) to stripped dot blot from C. 
None of the 62 control chromosomes carried this muta- 
tion. 

The second mutation in patient 1396 was identi- 
fied in exon 6: a heterozygous C->T substitution (bp 

20 1081) that converted an arginine residue to a cysteine 
residue, and abolished a Hhal restriction endonuclease 
site. Restriction analysis confirmed the heterozygous 
substitution in 1396 (Fig, 9D) and showed that 50 con- 
trol chromosomes were negative. Fig. 9D is the Hhal 

25 restriction analysis of exon 6 PCR products for 
patient 1396 and 2 controls (C). Expected fragments: 
control allele, 152, 86, 13 bp; mutant allele 165, 86 
bp. The 13 bp fragment has been run off the gel. 

30 (4) Additional sequence changes 

Hhal analysis of exon 6, mentioned above, 
revealed a DNA polymorphism. This change is a T— >C 
substitution at bp 1068 which does not alter the amino 
acid (serine), but creates a Hhal recognition site. T 

35 at bp 1068 was found in 9% of tested chromosomes. 



WO 95/33054 



PCT/CA95/00314 



Sequence analysis identified 2 discrepancies with the 
published cDN/ sequence: a G-»A substitution at bp 542 
which converts the glycine to an aspartate codon, and 
a C-»T change at bp 1032 which does not alter the amino 
5 acid (threonine). Since all DNAs tested (>50 chromo- 
somes) carried the A at bp 542 and the T at bp -1032/ 
it is likely that the sequence of the original cDNA 
contained some cloning artifacts. 

10 Genotype :pheno type correlation 

Table 2 summarizes the current status of muta- 
tions in severe MTHFR deficiency. In 8 patients, both 
mutations have been identified; in 5 patients (3 fami- 
lies), only 1 mutation has been identified. Overall 

IS the correlation between the genotype, enzyme activity, 
and phenotype is quite consistent. Five patients, 
with onset of symptoms within the first year of life, 
had < 3% of control activity. Three of these patients 
had missense mutations in the homozygous state: two 

20 . patients with the threonine to methionine substitution 
(C692T) and one patient with the proline to leucine 
substitution (C764T). The nonsense mutation (C559T) 
in the homozygous state in patients 1554 and 1627 
(previously reported in Goyette P et al., Nature 

25 Genetics, 1994, 2 : 195-200) is also associated with a 
neonatal severe form, as expected. 

The other patients in Table 2 had >, 6% of con- 
trol activity and onset of symptoms within or after 
the 2nd decade of life; the only exception is patient 

30 1834, as previously reported (Goyette P et al., Nature 
Genetics, 1994, 7:195-200). The three patients (356, 
458 and 1396) with missense mutations (G167A, C985T, 
C1015T and C1081T) are similar to those previously 
reported (patients 1779, 1834 and 1863 ) who had an 

35 arginine to glutamine substitution and a second 
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unidentified mutation (Goyette P et al., Nature Genet- 
ics, 1994, 7:195-200) The sisters with the 5* splice 
mutation and an unidentified second mutation also had 
levels of activity in the same range and onset of 
5 symptoms in the second decade, but the activity is 
likely due to the second unidentified allele. 

Discussion 

The patients come from diverse ethnic back- 

10 grounds. Although patients 1554 and 1627 are both 
Native Americans , the mutations occur on different 
haplotypes, suggesting recurrent mutation rather than 
identity by descent. Since the substitution occurs in 
a CpG dinucleotide, a "hot spot" for mutation, recur- 

15 rent mutation is a reasonable hypothesis. It is dif- 
ficult to assess whether some mutations are popula- 
tion-specific since the numbers are too small at the 
present time. 

MTHFR deficiency is the most common inborn 

20 error of folate metabolism, and a major cause of 
hereditary homocysteinemia . The recent isolation of a 
cDNA for MTHFR has permitted mutational analysis at 
this locus, with the aims of defining important 
domains for the enzyme and of correlating genotype 

25 with phenotype in MTHFR-def icient patients. 

Our definition of a disease-causing substitu- 
tion, as distinct from a benign polymorphism, is based 
on 3 factors: (1) absence of the change in at least 50 
independent control chromosomes; (2) presence of the 

30 amino acid in the bacterial enzyme, attesting to its 
evolutionary significance and (3) whether the change 
in amino acid is conservative. Although expression of 
the substitutions is required to formally prove that 
they are not benign, the criteria above allow us to 
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postulate that the changes described in this report 
are likely to affect activity. 

The 7 mutations described here (6 single amino 
acid substitutions and a 5' splice site mutation) 
5 bring the* total to 9 mutations identified thus far in 
severe MTHFR deficiency and complete the mutation 
analysis for 8 patients. The identification of each 
mutation in only one or two families points to the 
striking degree of genetic heterogeneity at this 
10 locus. Seven of the 9 mutations are located in CpG 
dinucleotides, which are prone to mutational events. 

5 ' splice site mutation 

The G->A substitution at the GT dinucleotide of 

15 the 5' splice site in patients 3 54 and 3 55 results in 
a 57bp in-frame deletion of the coding sequence, which 
should delete 19 amino acids of the protein. This 
deletion occurs as a result of the activation of a 
cryptic 5* splice site (AG/gc) even though this cryp- 

20 tic site does not have a perfect 5' splice site con- 
sensus sequence (AG/gt) . However, GC (instead of GT) 
as the first 2 nucleotides of an intron have been 
reported in several naturally-occurring splice sites, 
such as in the genes for human prothrombin and human 

25 adenine phosphoribosyltransf erase and twice within the 
gene for the largest subunit of mouse RNA polymerase 
II. The remaining nucleotides of the cryptic site 
conform to a normal splice site consensus sequence 
with its expected variations < A 60 

30 G79/g 10 otiooa59a 71 g82t50 > • 11 is unlikely that the 
deleted enzyme resulting from this aberrantly-spliced 
mKNA would have any activity; 8 of the 19 deleted 
amino acids are conserved in the E* coli enzyme. 
Although the 2 patients show residual enzyme activity 

35 in the range of 20% of controls, the activity is prob- 
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ably due to the unidentified second allele in these 
patients, - 

6 mis sense mutations 

5 The Arg->Cys substitution (C1081T) in patient 

13 96 is within a hydrophilic sequence previously pos- 
tulated to be the linker region between the catalytic 
and regulatory domains of MTHFR (Goyette P et al., 
Nature Genetics, 1994, 2:195-200). These 2 domains 

10 are readily separable by mild trypsiniaation of the 
porcine enzyme. The linker domain, a highly-charged 
region, is likely to be located on the outside surface 
of the protein and therefore more accessible to prote- 
olysis. Because the Arg-»Cys substitution converts a 

15 charged hydrophilic residue to an uncharged polar 
residue, it cannot be considered a conservative 
change, and could affect the stability of the enzyme. 

The 2 Arg— >Cys substitutions identified in 
patients 356 and 458 (C985T and C1015T, respectively) 

20 may be involved in binding the FAD cof actor. Previ- 
ous work in the literature showed that heating fibro- 
blast extracts at 55°, in the absence of the FAD 
cof actor, inactivated MTHFR completely. The addition 
of FAD to the reaction mixture before heat inactiva- 

25 tion restored some enzyme activity to control extracts 
and to extracts from some patients, while the extracts 
of patients 356 and 458 were unaffected. Based on 
these observations, it was suggested that these 2 
patients had mutations affecting a region of the pro- 

30 tein involved in binding FAD. The 2 mutations are 
found in close proximity to each other, within 11 
amino acids. In patient 356, the Arg residue is evo- 
lutionarily-conserved in E. coli and is found in a 
stretch of 9 conserved amino acids, suggesting a 

35 critical role for this residue; the altered Arg resi- 
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due in patient 458 is not evolutionarily-conserved. 
Crystal structure analysis of medium chain acyl-CoA 
dehydrogenase (MCAD), a f lavoprotein , has defined 
critical residues involved in the binding of FAD. Two 
5 consecutive residues of the MCAD protein, Metl65 and 
Trpl66, involved in interactions with FAD , can also be 
identified in MTHFR , 3 and 4 amino acids downstream, 
respectively, from the Arg residue altered in patient 
458. 

10 The Thr^Met substitution (C692T) , is found in 

a region of high conservation with the E* cold enzyme 
and in a region of good homology with human dihydro- 
folate reductase (DHFR) (Fig* 11). In Fig. 11, = is 
identity; ♦ is homology; and o is identity to bovine 

15 DHFR enzyme. An asterisk (*) indicates location of 
Thr->Met substitution. Considering the early-onset 
phenotype of the patients, one can assume that the 
threonine residue is critical for activity or that it 
contributes to an important domain of the protein. 

20 . This region of homology in DHFR contains a residue, 
Thrl3 6, which has been reported to be involved in 
folate binding. This Thr residue in DHFR aligns with 
a Ser residue in MTHFR, an amino acid with similar 
biochemical properties. The Thr — > Met substitution is 

25 located 8 amino acids downstream from this Ser codon, 
in the center of the region of homology between the 2 
enzymes. We therefore hypothesize that the Thr — > Met 
substitution may alter the binding of the folate sub- 
strate, 

30 , The G167A (Arg -» Gin) and C764T (Pro -> Leu) 

substitutions both affect non-conserved amino acids. 
Their, importance in the development of MTHFR defi- 
ciency cannot be determined at the present time. All 
the mutations identified thus far are located in the 

35 5 1 end of the coding sequence, the region thought to 
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encode the catalytic domain of MTHFR . Mutation analy- 
sis has been useful in beginning to address the struc- 
ture: function properties of the enzyme as well as to 
understand the diverse phenotypes in severe MTHFR 
5 deficiency. 

Ill- Identification of A-»V mutation 

SSCP analysis and direct sequencing of PCR 
fragments were used to identify a C to T substitution 

10 at bp 677, which converts an alanine residue to a 
valine residue (Fig. 5A). The primers for analysis of 
the A->V change are: 5 * -TGAAGGAGAA GGTGTCTGCG GGA-3 ' 
( exonic ) and 5 1 -AGGACGGTGC GGTGAGAGTG-3 ' ( intronic ) ; 
these primers generate a fragment of 198 bp. Fig. 5A 

15 depicts the sequence of two individuals, a homozygote 
for the alanine residue and a homozygote for the 
valine residue. The antisense strands are depicted. 
This alteration creates a Hi nil site (Fig. 5B), which 
was used to screen 11.4 unselected French Canadian 

20 chromosomes; the allele frequency of the substitution 
was .38. The substitution creates a Hinfl recognition 
sequence which digests the 198 bp fragment into a 17 5 
bp and a 23 bp fragment; the latter fragment has been 
run off the gel. Fig. 5B depicts the three possible 

25 genotypes. The frequency of the 3 genotypes were as 
follows: -/-, 37%; +/-, 51%; and +/+, 12% (the ( + ) 
indicates the presence of the Hinfl restriction site 
and a valine residue). 

The alanine residue is conserved in porcine 

30 MTHFR , as well as in the corresponding metF and 
stymetF genes of E. coli and 5. typhimurium, respec- 
tively. The strong degree of conservation of this 
residue, and its location in a region of high homology 
with the bacterial enzymes, alluded to its importance 

35 in enzyme structure or function- Furthermore, the 
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frequency of the (+/+)' genotype was consistent with 
the frequency of the thermolabile MTHFR variant impli- 
cated in vascular disease, 

5 Clinical material 

To determine the frequency of the A— >V muta- 
tion, DNA from 57 individuals from Quebec was analyzed 
by PCR and restriction digestion. The individuals, 
who were all French Canadian/ were not examined clini- 

10 cally or biochemically. 

The 40 individuals analyzed in Table 3 had been 
previously described in Engbersen et al. {Am. J. Hum. 
Genet., 1995, -5£: 142-150 ) . Of the 13 cardiovascular 
patients, 8 had cerebrovascular arteriosclerosis and 5 

15 had peripheral arteriosclerosis. Five had thermo- 
labile MTHFR while 8 had thermostable MTHFR (greater 
than 33% residual activity after heating). Controls 
and patients were all Dutch-Caucasian, between 20-60 
years of age. None of these individuals used vitamins 

20 , which could alter homocysteine levels . Enzyme assays 
and homocysteine determinations were also reported by 
Engbersen et al. (Am. J. Hum. Genet., 1995, 56 : 142- 
150 ) . 
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Table 3 

Correlation between MTHFR genotype and enzyme activ- 
ity, tbermolability and plasma homocysteine level 

5 _ 





-/- 




+/+ 




n-19 




n=12 


specific activity 3 '** 


22.9± 1 .7 


15.0 ±0.8 


6.9 ±0.6 


(nmol CH20/mg.protein/hr) 


(11.8-33.8) 


(10.2-18,8) 


(2.6-10.2) 


residual activity after 


66,8 ± 15 


56.2 ±2.8 


21.8 ±2.8 


heating a » b (%) 


(56-76) 


(41-67) 


(10-35) 


plasma homocysteine a » c 


12.6 ±1.1 


13.8 ±1,0 


22.4 ±2.9 


(pM)(after fasting) 


(7-21) 


(9.6-20) 


(9.6*42) 


plasma homocysteine a » c 


41.3±5.0 d 


41 ±2.8 


72.6 ±11.7® 


(MM)(post*methionine load) 


(20.9-110) 


(29.1-54) 


(24.4-159) 



3 one-way anova p<.01 

D paired t test for all combinations p<.01 

c paired t test p<,05 for +/+ group versus +/- group or group; p>.05 for +/- versus 
-A group. 
10 d n-1 8 for this parameter 
e n-11 for this parameter 

Enzyme activity and plasma homocysteine were 
determined as previously reported. Each value repre- 
sents mean ± standard error. The range is given in 
15 parentheses below the mean. 

Correlation of A-»V mutation with, altered MTHFR func- 
tion 

A genotypic analysis was performed and enzyme 
20 activity and thermolability were measured in a total 
of 40 lymphocyte pellets from patients with premature 
vascular disease and controls. 13 vascular patients 
W ere selected from a previous study (Engbersen etal., 
Am. J. Hum. Genet., 1995, 56.: 142-150 ) , among which 5 
25 were considered to have thermolabile MTHFR . From a 
large reference group of 89 controls, all 7 
individuals who had thermolabile MTHFR were studied, 
and an additional 20 controls with normal MTHFR were 
selected from the same reference group. Table 3 
30 documents the relationship between genotypes and spe- 
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cific enzyme activity, thermolabxlity and plasma homo* 
cysteine level • The mean MTHFR activity for individu- 
als homozygous for the substitution (+/+) was approxi- 
mately 30% of the mean activity for (-/-) individuals, 
5 homozygous for the alanine residue. Heterozygotes had 
a mean MTHFR activity that was 65% of the activity of 
{-/-) individuals; this value is intermediate between 
the values for ( ) and (+/+) individuals. The 
ranges of activities showed some overlap for the 

10 heterozygous and (*/-) genotypes, but homozygous (+/+) 
individuals showed virtually no overlap with the for- 
mer groups . A one-way analysis of variance yielded a 
p value <.Q001; a pairwise Bonf erroni t test showed 
that all three genotypes were significantly different 

15 with p<0»01 for the three possible combinations. 

The three genotypes were all significantly dif- 
ferent (p<.01) with respect to enzyme thermol ability . 
The mean residual activity after heat inactivation for 
5 minutes at 46° was 67%, 56% and 22% for the (-/-), 

20 <+/-) and :(+/+) genotypes, respectively. While the 
degree of thermolability overlaps somewhat for (-/-) 
individuals and heterozygotes , individuals with two 
mutant alleles had a distinctly lower range. Every 
individual with the (+/+) genotype had residual activ- 

25 ity <35% after heating, and specific activity <50% of 
that of the <-/-) genotype. 

Total homocysteine concentrations, after fast- 
ing and 6 hours after methionine loading, were meas- 
ured in plasma by high performance liquid chromatogra- 

30 phy using fluorescence detection. Fasting homocysteine 
levels in (+/+) individuals were almost twice the 
value for (+/-) and <-/-) individuals. The differ* 
ences among genotypes for plasma homocysteine were 
maintained when homocysteine was measured following 6 

35 hours of methionine loading. A one-way anova yielded 
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a p < .01 for the fasting and post-methionine homocys- 
teine levels. A pairwise Bonferroni t test showed 
that only homozygous mutant individuals had signifi- 
cantly elevated homocysteine levels (p<.05). 

5 

PCR-based mutagenesis for expression of A-»V mutation 
xn vitro 

PCR-based mutagenesis, using the cDNA-contain- 
ing Bluescript™ vector as template, was used to create 

10 the A to V mutation* Vent™ polymerase (NEB) was used 
to reduce PCR errors. The following primers were 
used: primer 1, bp -200 to -178, sense; primer 2, bp 
667 to 687, antisense, containing a mismatch, A, at bp 
677; primer 3, 667 to 687, sense, containing a mis* 

15 match, T, at bp 677; primer 4, bp 1092 to 1114, 
antisense* PCR was performed using primers 1 and 2 to 
generate a product of 887 bp, and using primers 3 and 
4 to generate a product of 447 bp. The two PCR frag- 
ments were isolated from a 1.2% agarose gel by Gene- 

20 clean™ (BIO 101). A final PCR reaction, using primers 
1 and 4 and the first 2 PCR fragments as template, was 
performed to generate a 1.3 kb band containing the 
mutation. The 1.3 kb fragment was digested with Ncol 
and Mscl, and inserted into the wild-type cDNA- con- 

25 taining expression vector by replacing the sequences 
between the Ncol site at bp 11 and the Mscl site at bp 
94 J. The entire replacement fragment and the cloning 
sites were sequenced to verify that no additional 
changes were introduced by PCR. 

30 

Expression analysis of wild-type and mutagenized cDNA 

Overnight cultures of JM105™ containing vector 
alone/ vector + wild-type MTHFR cDNA, or vector + 
mutageni2ed cDNA were grown at 37° C. in 2 x YT media 
35 with .05 mg/ml ampicillin. Fresh 10 ml. cultures of 
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each were inoculated with approximately 50 of over- 
• night cultures for a starting O.D. of 0.05, and were 
grown at 37° C to an O.D. of 1 at 420 nM. Cultures 
were then induced for 2 hrs. with 1 mM IPTG and pel- 
5 letted- The cells were resuspended in TE buffer with 
2 }jg/ml aprotinin and 2^ig/ml leupeptin (3.5 x wet 
weight of cells) ♦Cell suspensions were sonicated on 
ice for 3 x 15 sec. to break open cell membranes and 
then centrifuged for 30 mins. at 4°C. to pellet cell 

10 debris and unlysed cells. The supernatant was removed 
and assayed for protein concentration with the Bio-Rad 
™ protein assay. Western analysis was performed using 
the Amersham ECL™ kit according to the instructions 
of the supplier, using antiserum generated against 

15 purified porcine liver MTHFR. Enzymatic assays were 
performed by established procedures; thermolability 
was assessed by pre-treating the extracts at 4 6°C. for 
5 rains, before determining activity - Specific activi- 
ties (nmol f ormaldehyde/hr . /mg . protein) were calcu- 

20 lated for the 2 cDNA-containing constructs after sub- 
traction of the values obtained with vector alone ( { to 
subtract background E. coll MTHFR activity). 

The MTHFR cDNA (2.2 kb) (Fig. 6} has an open 
reading frame of 1980 bp, predicting a protein of 74.6 

25 kDa. The purified porcine liver enzyme has been shown 
to have subunits of 77 kDa. Western analysis 
(Fig.._ 7A) of several human tissues and of porcine 
liver has revealed a polypeptide of 77 kDa in all the 
studied tissues, as well as an additional polypeptide 

30 of approximately 70 kDa in human fetal liver and in 
porcine liver, suggesting the presence of isozymes. 
Two ^g of bacterial extract protein was used for lanes 
1-3* The tissues (lanes 4-8) were prepared by 
homogenization in .25M sucrose with protease 

35 inhibitors (2 ^g/ml each of aprotinin and leupeptin), 
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followed by sonication (3 x 15 sec.) on ice. The 
extract! were spun for 15 min. in a microcentrifuge at 
14/000 g and 100 \ig of supernatant protein was used 
for Western analysis. h=human; p=porcine. 
5 The wild-type cDNA and a mutagenized cDNA, con- 

taining the A-»V substitution, were expressed in JE. 
coli to yield a protein of approximately 70 kDa 
(Pig. 7A), which co-migrates with the smaller polypep- 
tide mentioned above. Treatment of extracts at 46 °C 

10 for 5 minutes revealed that the enzyme containing the 
substitution was significantly more thermolabile than 
the wild-type enzyme (p<.001; Fig. 7B), Two separate 
experiments {with 3-4 replicates for each construct 
for each experiment) were performed to measure thermo- 

15 stable activity of the wild-type MTHFR and mutagenized 
MTHFR A— >V cDNAs - The values shown represent mean ± 
standard error for each experiment, as % of residual 
activity after heating. The means of the specific 
activities before heating (expressed as nmol formalde- 

20 hyde/hr,/mg. protein) were as follows: Exp. 1, 3-8 and 
5.3 for MTHFR and MTHFR A-»V, respectively; Exp. 2, 6.2 
and 7.5 for MTHFR and MTHFR A->V, respectively. The 
expression experiments were not designed to measure 
differences in specific activity before heating, since 

25 variation in efficiencies of expression could contrib- 
ute to difficulties in interpretation. Curiously 
though, the specific activity for the mutant construct 
was higher in both experiments. It is possible that 
the mutant pr otein has j n^r^^ ^d stability in E. cold, 

30 or ^ tha t Inclusion bodies in our extracts contributed 
to differences in recovery of properly-assembled 
enzyme . 

These studies have identified a common substi- 
tution in the MTHFR gene which results in thermolabil- 
35 ity in vitro and in vivo. The mutation, in the 
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heterozygous or homozygous state, correlates with 
reduced enzyme activity and increased thermolability 
of MTHFR in lymphocyte extracts- A significant eleva- 
tion in plasma homocysteine was observed in individu- 
5 als who were homozygous for the mutation. Statistic 
cally-signif icant differences for homocysteine levels 
were not observed between heterozygotes and (-/-) 
individuals; this observation is not surprising, since 
plasma homocysteine can be influenced by several envi- 

10 ronmental factors, including intake of folate, vitamin 
B 12' vitamin 85, and methionine, as well as by genetic 
variation at other loci, such as the cystathionine-^- 
synthase gene. 

The alanine to valine substitution conserves 

15 the hydrophobicity of the residue and is associated 
with small changes in activity, in contrast to non- 
conservative changes, such as the previously-reported 
arginine to glutamine change in MTHFR, which is asso- 
ciated with a greater decrease in enzyme activity and 

20 severe hyperhomocysteinemia. The alanine residue is 
situated in a region of homology with the bacterial 
metF genes. We have also observed the same region of 
homology in the human dihydrof olate reductase (DHFR) 
gene (Fig. 11), although the alanine residue itself is 

25 not conserved; this region of amino acids 130-149 of 
DHFR contains T136 which has been implicated in folate 
binding in an analysis of the crystal structure of 
recombinant human DHFR. It is tempting to speculate 
that this region in MTHFR is also involved in folate 

30 binding and that the enzyme may be stabilized in the 
presence of folate. This hypothesis is compatible 
with the well-documented influence of folate on homo- 
cysteine levels and with the reported correction of 
mild hyperhomocysteinemia by folic acid in individuals 
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with premature vascular disease, and in individuals 
with thermolabile MTHFR . 

Although our cDNA is not long enough to encode 
the larger MTHFR polypeptide, it is capable of direct- 
5 ing synthesis of the smaller isozyme. The ATG start 
codon for this polypeptide is within a good consensus 
sequence for translation initiation. Whether the 
isozyme is restricted to liver and what its role is in 
this tissue remain to be determined, 

10 These data have identified a common genetic 

change in MTHFR which results in thermo lability ; our 
experiments do not directly address the relationship 
between this change and vascular disease. Nonethe- 
less, this polymorphism represents a diagnostic test 

15 for evaluation of MTHFR thermol ability in hyperhomp- 
cysteinemia. Large case-control studies are required 
to evaluate the frequency of this genetic change in 
various forms of occlusive arterial disease and to 
examine the interaction between this genetic marker 

20 and dietary factors. Well-defined populations need to 
be examined, since the limited data set thus far sug- 
gests that population-specific allele frequencies may 
exist. More importantly, however, the identification 
of a candidate genetic risk factor for vascular dis- 

25 ease, which may be influenced by nu trient intak e , rep- 
resents a critical step in the design of appropriate 
therapies for the homocysteinemic form of arterioscle- 
rosis. 

3 0 CDNA FOR MTHFR AND ITS POTENTIAL UTILITY 

A human cDNA for MTHFR (2.2 kb ) has been iso- 
lated, as reported by us in Goyette et al. (Nature 
Genetics, 1994, 1:195-200) and Frosst et al . (Nature 
Genetics, 1995, 10:111-113). The cDNA has been 
35 expressed in vitro to yield a MTHFR protein of 



WO 95/33054 



38 - 



PCT/CA95/00314 



approximately 70 kDa (Frosst P et al., Mature Genet- 
ics, 1995, 14:111-113). 

Using the cDNA sequence/ mutations in patients 
with severe and mild MTHFR deficiency (Goyette P et 
5 al., Nature Genetics, 1994, 7:195-200? Goyette P et 
al., Am, J, Hum, Genet., 1995, .56 :1052-1059; Frosst P 
et al., Nature Genetics, 1995/ 1_0: 111-113) were iden- 
tified. 

The cDNA sequence is a necessary starting point 
10 for the detection of MTHFR sequence changes that would 
identify individuals at risk for cardiovascular and 
neurological diseases, as well as other disorders 
affected by folic acid metabolism. Diagnostic tests by 
DNA analysis are more efficient and accurate than 
15 testing by enzymatic/biochemical assays. Less blood is 
required and results are available in a shorter period 
of time. The tests could be performed as a routine 
operation in any laboratory that performs molecular 
genetic diagnosis, without the specialized 
20 Reagents/expertise that is required for an enzyme- 
based test. 

The second major utility of the cDNA would be 
in the design of therapeutic protocols, for correction 
of MTHFR deficiency. These protocols could directly 

25 involve the gene, as in gene therapy trials or in the 
use of reagents that could modify gene expression. 
Alternatively, the therapy might require knowledge of 
the amino acid sequence (derived from the cDNA 
sequence)/ as in the use of reagents that would modify 

30 enzyme activity. The identification of sequences 
and/or sequence changes in specific regions of the 
cDNA or protein, such as FAD binding sites or folate- 
binding sites, are useful in designing therapeutic 
protocols involving the above nutrients. 



35 
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UTILITY OF INVENTION IN CLINICAL AND DIAGNOSTIC 
STUDIES 

Coronary artery disease patients in Montreal 
(n=153) were studied to examine the frequency of the 
5 alanine to valine substitution. Fourteen percent of 
the patients were homozygous for this mutation. An 
analysis of 70 control individuals (free of cardiovas- 
cular disease) demonstrated that only seven % of these 
individuals were homozygous for the alanine to valine 
10 mutation. 

Analysis of homocysteine levels in 123 men of 
the above patient group indicated that the mutant 
allele significantly raised homocysteine levels from 
10*2 micromoles/L in homozygous normal men to 11.5 and 
15 12.7 in heterozygotes and homozygous mutants, respec- 
tively. 

Families with a child with spi na bifida , a 
neural tube defect, have been examined for the 
presence of the alanine to valine mutation. 

20 Approximately 16% of mothers who had a child with 
spina bifida were homozygous for this mutation, while 
only 5% of control individuals were homozygous. 
Fathers of children with spina bifida also had an 
increased prevalence of the homozygous mutant genotype 

25 (10%) as did the affected children themselves (13%). 

Table 4 indicates the interactive effect of 
folic acid with the homozygous mutant alanine to 
valine change. In a study of families from Framingham/ 
Massachusetts and Utah, individuals who were 

30 homozygous mutant but had folate levels above 5 ng/ml 
did not have increased homocysteine levels compared to 
individuals with the normal or heterozygous genotype. 
However, individuals who were homozygous mutant but 
had folate levels below 5 ng/ml had homocysteine 

35 levels that were significantly higher than the other 
genotypes- 
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Table 4 

Mean fasting and PML homo cysteine levels for different 

MTHFR genotypes 



MTHFR genotype 



Plasma Homocysteine 


Normals (-/-) 


Hettrazygotes 
<+/-) 


Homozygotes 


p trend 


N 


58 


61 


30 






9.4 


9.2 


12.1 


0.02 


Folate <5 ng/ml 


10.2 


10.4 


15.2 


0.002 


Folate 2 5ng/mL 


82 


7.5 


7.5 


0.52 


Post-Methtonlne load 


30.0 


30.9 


31.3 


0.62 



5 * Significant interaction between folate levels and genotype (p=0.03) 

Example III provides preliminary data for 
therapeutic intervention by folic acid supplementation 
to individuals who are homozygous for the alanine to 

10 valine change- The data suggest that higher levels of 
plasma folate would lead to normalization of homocys- 
teine levels in mutant individuals and might prevent 
' the occurrence of disorders associated with high homo- 
cysteine levels, such as ca rdiovascular disease , neu- 

15 ral tube defect s , and possibly other disorders. Folic 
acid supplementation for mutant individuals might also 
restore methionine and S-adenosylmethionine levels to 
normal. This would be relevant for disorders that are 
influenced by methylation, such as neoplasias, _devel- 

20 opmental anomalie s, neurologic disease, et c- 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 

25 tions, uses, or adaptations of the invention follow- 
ing, in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
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art to which the invention pertains and as may be 
applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims. 
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1- A cDNA probe for human methylenetetrahydro- 

folate reductase ( MTHFR ) , 

2. Use of the probe of claim 1 for the iden- 

tification of sequence abnormalities in patients with 
severe or mild MTHFR deficiency, 

3- The use of claim 2, wherein said deficiency is 

selected from the group consisting of cardiovascular 
and neurological disorders and disorders influenced by 
folic acid metabolism, 

4, Use of the probe of claim 1 for gene therapy to 

produce the MTHFR protein, 

5* A human MTHFR protein which is encoded by the 

probe of claim 1, 

6o Use of the probe of claim 1 and the protein of 

claim 4 for therapy of MTHFR-def iciency patients by 
dietary, biochemical or pharmacological approaches . 
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10 20 30 U 50 

X XX X XI X xi * 

AAI KC GGA GCC ATG GT6 AAC GAA GCC AGA GGA AA{ AGC AGC CH A A C (tt 

T1A AGG CU CGG TAC CAC TIG C T T CGG TCT C C T TIG TCG TCG GAG 7 T G GGG 

As n Ser Gty Ala Me t Val Asn Glu Ala A r 9 Gly Asn S e r Ser leu Asn Pro> 



(0 • 70 80 90 100 

II X X X X II XX 

TGC. 7 T G GAG GG( AGT GC( AGC AG I GG( AG1 GAG AGC HC AAA GAf AGC TCG 

ACG AAC CTt CCG HA CGG ICG' HA CCG HA CH TCG AGG TM CIA HA AGC 

[ys Leu Glu Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser lys Asp Ser Ser> 



no no no ho no 

X X XI X XX X XI 

AGA 101 HI ACC CCG GGC CI& GAC (CI GAG CGG CAT GAG AGA CH CGG GAG 

TCT AC A AGG TGG GGC CCG GAC CTG GGA CTC GCC G1A CH HI GAG GCC CH 

Ar g C y 5 Ser Ttu Pro Gly leu Asp Pro Glu Arg His Glu A r q leu A r 9 Glu> 



HO UO 180 1 90 200 

X I X X XX X XI X 

AAG ATG AGG CGG CGA TIG GAA HI GGI GAC AAG TGG TH TIC CTG GAA TIC 

TH TAC ICC GCC GCT AAC (IT AGA CCA CTG TIC AC C AAG AGG GAC CTT AAG 

lys Mel Arg Arg Arg leu Glu Ser Gly Asp lys Irp Phe Ser leu Glu Phe> 



210 220 230 

I I X X X I 

IK CU CCT CGA ACT GU GAG &GA GU 

AAG GGA GGA GU IGA CGA CH CCI (GA 

Phe Pro Pro Arg Ihr Ala Glu Gly Ala 



740 250 

X X X X X 

GU AAT CH AK KA AGG TIF GAC 

C AG HA GAG TAG AGT KC AAA CTG 

Val Asn leu lie Ser Arg Phe As p > 



2 6 0 21 0 2 8 0 290 300 

XXX * xx t XXI 

CGG ATG GCA GCA GGT GGC CCC CTC TAC ATA GAC GTG ACC TGG C AC CCA OCA 

GCC TAC C GT CGT CCA CCG GGG GAG ATG TAT CTG C AC TGG ACC GTG GGT CGT 

Arg He I Ala Ala Gly Gly Pro leu lyr lie Asp Vat Thr Trp His Pro Ala* 



310 320 330 

X X X x x 

GGT GAC CU GGC KA GAC AAG GAG 

CCA CTG GGA CCG AGT CTG TH CTC 

Gly Asp Pro Gly Ser Asp Lys Glu 



34 0 350 

X I X X X 

ACC TCC TCC ATG AIG AfC GCC AGC ACC 
TGG AGG AGG TAC fAC TAG CGG TCG TGG 
Ihr Ser Ser Met Met He Ala Ser Thr, 

fx=r 7fl 
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3 6 0 3 7 0 3 8 0 3 9 0 WO 

X X XI X lv X X) 

6(C G T G AAC T AC TGI GGC UG GAG AC C AH UG C AC A1G ACC IGC TGC CGI 

CGG C AC HG AIG ACA CCG 0 AC CTC IGG TAG GAC GIG IAC IGG AC6 ACC GCA 

Ala Val Asn T y r C y s G I y leu Glu Ihr lie leu His Mel Thr Cys Cys Arg» 



UO UO UO UO (SO 

X XX X XX X XX X 

(AG CQC HO GAG GAG ATC AC6 GGC CA1 CIO (At AAA GC T AAG (AG UG GGC 

GTC GCG GA( (K CTC TAG TGC (CG GIA GAC GIG TTT CGA U( GK GAC CCG 

Gin Arg Leu Glu Giu lie Ihr Gly His leu His lys Ala lys Gin leu Giy> 



UO 470 C 8 0 WD 500 510 

XX XXX XXXI XI 

CIG AAG AAC ATC ATG GCG CIG CGG GGA GAC CCA AIA GOT GAC (AG IGG GAA 

GAC TTC TIG TAG T AC CGC GAC GCC CCI CIG GGI IAI CCA CIG GU ACC UI 

leu lys Asn lie Met Ala leu Arg Gly Asp Pro He Gly Asp Gin Irp Giu» 



5?0 530 SW 5 S 0 560 ' 

IX X ■ X X X X X X I 

GAG GAG GAG GGA GGC TIC AAC IAC GCA GIG GAC CIG GIG AAG (AC AIC CGA 

CTC CTC CTC CCT CCG AAG TTG AIG CGI CAC CIG GAC (AC TTC GTG TAG GU 

Glu Glu Glu 61? Gly Phe Asn T y r Ala Val Asp leu Val lys His He Arg > 



570 5J0 

X X X X X 

AG T GAG ITT GGI GAC IAC III GAC 

HA CTC AAA (CA CIG AIG AAA CIG 

Ser Glu Phe Gly Asp lyr Phe Asp 



590 WO 610 

X X J X % 

AIC IG1 GIG GCA GGI IAC CCC AAA GG[ 

TAG ACA [AC CGI CCA AIG GGG III CCG 

He Cys Val Ala Gly lyr Pro lys Gty> 



wo wo wo wo wo 

I X X X X X X X X X 

CAC (CC GAA GCA GGG AGC TTT GAG GC I GAC CIG AAG (AC T f G AAG GAG AAG 

GTG GGG (TT CGI CCC TCG AAA UC CGA C TG GAC TIC GTG AAC TTC CTC TTC 

His Pro Glu Ah Gly Ser Phe Glu Ala Asp leu lys His leu lys Glu lys» 



670 



6(0 



690 



700 



710 



GTG TCT GCG GGA GCC GAT IK AIC ATC ACG [AG CIT TTC TIT GAG GC T GAC 
CAC AGA CGC (CT CGG (FA AAG TAG TAG TGC GK GAA AAG AAA CTC (GA UG 
Val Ser Ala Gly Ala Asp Phe He lie Th r Gin leu Phe Phe Gl u Ala Asp ► 
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7 2 0 1 3 0 7U 

X X X XX x 

aca nc nc ccc ni gig aag gu tgc 

IGI AAG AAG GCO AAA C AC TIC CGI ACG 
Thr Phe Phe Arg Phe Vs t lys A I a (ys 



750 760 

x x t x x 

Alt GAC AIG GGC AH AC T TGC [(t 

TGG C 16 T AC CCG TAG TGA ACG GGG 

Thr Asp Mel Gly He Thr (ys Pro* 



770 780 790 BOO 810 

X XI I XI X XX X 

AK GIC CCC GGG A T C III KC Alt CAG GGC I AC (AC ICC CTT (GG (AG (IT 

IAG CAG GGG CCC IAG AAA GGG TAG G7C (CO AIG GTC AGG GAA GCC GIC GAA 

He Vil Pro Gly lie Phe Pro lie Gin Gly lyr His Ser leu Arg Gin leu- 



82 0 8 30 BU 850 660 

xx jr xx x xx i i 

GTG AAG CTG T C C AAG C T G GAG GTG CCA CAG GAG AH AAG GAC GIG AIT GAG 

CAC TIC GAC AGG TTC GAC CTC C AC GGT GTC UC TAG T T C CTG CAC TAA CTC 

Va( lys Leu Ser lys Leu 61 u Val Pro Gin 61 u He lys Asp V a I He Glu> 



870 






81 


30 








900 






910 






X 




X 




X 


1 


X 




X X 








X 


i 




CCA 


AK 


AAA 


GAC 


AAC 


GAT 


GU GCC 


AH 


CGC AAC 


IAI 


GGC 


AK 


GAG 


(TG 


GCC 


GGI 


IAG 


HI 


CIG 


TIG 


CfA 


(GA CGG 


TAG 


GCG TIG 


ATA 


CCG 


IAG 


tK 


GAC 


CGG 


Pro 


He 


lys 


Asp 


Asn Asp Ala Ala 


lie 


Arg Asn 


lyr 


Gly 


He 


01 o 


Leu 


Ala> 


9?0 






no 






no 




950 






960 








X 




X 


R 




> 


X 


X 


X 




X 


X 




> 




GIG 


AGC 


CTG 


TGC 


CAG 


GAG 


CTT CTG 


GCC 


AG T GGC 


IIG 


GTG 


CCA 


GGC 


CIC 
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